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ABSTRACT 

We propose a novel technique to refine the modelling of galaxy clusters mass distribution 
using gravitational lensing. The idea is to combine the strengths of both "parametric" and 
"non-parametric" methods to improve the quality of the fit. We develop a multi- scale model 
that allows sharper contrast in regions of higher density where the number of constraints is 
generally higher. Our model consists of (i) a multi- scale grid of radial basis functions with 
physically motivated profiles and (ii) a list of galaxy-scale potentials at the location of the 
cluster member galaxies. This arrangement of potentials of different sizes allows to reach a 
high resolution for the model with a minimum number of parameters. We apply our model to 
the well studied cluster Abell 1689. We estimate the quality of our mass reconstruction with a 
Bayesian MCMC sampler. For a selected subset of multiple images, we manage to halve the 
errors between the positions of predicted and observed images compared to previous studies. 
This owes to the flexibility of multi- scale models at intermediate scale between cluster and 
galaxy scale. The software developed for this paper is part of the public lenstool package 
which can be found at www.oamp.fr/cosmology/lenstool. 
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1 INTRODUCTION 

Non-baryonic dark matter is today commonly accepted as a pre- 
dominant contributor to the matter density of our Universe. Dark 
Matter is indeed required to explain the ve l ocity distribution of 
stars and gas in galaxies (e.g ISaluccill200ll : IdeBlok et al.ll2003h 
or the velocity dis persion of the galaxies in galaxy clusters (e.g. 
ICzoske et al.ll2002h but also to reproduce the large scale galaxy dis- 
tribution fe.g. lGuzzo et al . 2008; Se liak et al .12 005) and the cosmo- 
logic al microwave backgr ound (CMB) fluctuations (e.g. the review 
by iHu & D odelson 2Qo2). A particularly striking demonstration 
of the need of Dark Matt er is the direct detection of Dark Ma t- 
ter in the "Bullet Cluster" dClowe et al.ll200d : lBradac et al.ll2006h a 
cluster merger of 2 massive clusters where the X-ray gas and DM 
are spatially segregated due to the difference in nature of baryons 
and DM. However, beyond this general agreement about the need 
of dark matter, very little is known about its nature. The increas- 
ing amount of observational evidences steadily rules out possibili- 
ties. For instance, th e CMB seems to rule out the warm dark matter 
(Sper gel et al.l l2003) and the measured extension of the truncation 
radius of galaxies in clusters seems to rule out flu id-like, strongly 
interacting dark matter at 5<x dNataraian et al.l2007D . On another as- 
pect, numerical simulations show that the collapsed non collisional 
dark matter forms NFW density profiles jNavarro. Frenk & W hite 
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with a central density peak. In contrast, if self-interacting 
particles a re considered, this peak ca n be replaced by a profile with 
flat core (Spergel & Steinhardt l2000h . On the observational side, 
measuring the slope of the dark matter density profile is still a hot 
topic either at t he galaxy scale, with discrepancies in the r otation 
curves of stars dGentile et al J 120071 : IValenzuela et al J 120070 . or at 
the cluster scale, with discrepancies related to the slope of den- 
sity profiles determined by gravitational lensin g jMeneghetti et al] 
120071 : ISand eTai1l2008l : Limousin et alJl2008h . Finally, numerical 
simulations with self-interacting particles predict less small scale 
halos than simulations with non collisional particles. For a long 
time, the missing satellite problem in the Local Group was consid- 
ered as an evide nce in favor of the se lf-interacting particles hypoth- 
esis. However, ISimon & Gehal d2007h have shown that this prob- 
lem could mere l y be a n observational issue. On the cluster scale, 
iNataraian et al] (120071) has shown that the mass function of sub- 
structures was in agreement with simulations with non-collisional 
particles, at least for a few strong lensing clusters. Estimating the 
mass distribution of cosmological objects with great accuracy is 
therefore a unique way to unveil the nature of dark matter. 

Since the early 90' s, gravitational lensing has appeared as a 
robust tool to model the mass distribution of cosmological ob- 
jects alike galaxies, galaxy clusters and l a rge scale structures 



jects alike galaxies, galaxy clusters ana large scale structures 
dGavazzi et al 1 120071 : Limousin etaD 120071 : iMassev et all 120071 : 
iFu et al.ll2008h . With deep HST/ACS observations of massive clus- 
ters of galaxies, a large number of multiple images have been un- 
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covere d. In particular in the case of Abell 1689, iBroadhurst et al.1 
(2005) were the first to identify more than 100 multiple images part 
of more than 30 multiple images systems. However, in such mas- 
sive clusters, strong lensing modelling has been unable to repro- 
duce the numerous systems of multip le images with less tha n typi- 
cally 1" residual RMS. In Abell 1689. lLimousin et al.U2007h obtain 
an RM S of 2.87" for 34 systems of multiple images, lHalkola et al.l 
d2006h report an RMS o f 2.7" andlBroadhurst et all <2005h an RMS 
of 3.2". In Abell 1703, iLimousin et all d20Q8h obtain an RMS of 
1.45" with 13 system s of multiple images, and in Abell 2218, 
lElfasdottir etail (l2Q09h obtain an RMS of 1.49" with 8 systems. 
The physical origin of this systematical error is not yet fully under- 
stood. Do we miss invisible small scale subhaloes in our models, or 
are we badly reconstructing the large scale mass distribution? 

These large residual errors are likely highlighting a lack of res- 
olution and/or flexibility in the lensing mass models. Indeed, mass 
models traditionally consist of an analytical density profile centered 
with respect to the light distributi on, and fitted to t he positions of 
the multiple images. In addition, iKneib et all dl996h have shown 
that the complementary modelling of galaxy- scale halos hosting 
bright cluster m ember galaxies significantly improves the fit (see 
Julio et al J2007L for a throughout description of the analytical mod- 
elling of clusters). In contrast to traditional "parametric" modelling 
of galaxy clusters, partisans of "non-parametric" models claim that 
their methods may allow a perfect fit; however at the expenses of 
sometimes unphysical solutions. In the case of "non-parametric" 
method, the mass distribution is generally tessel lated into a regu- 
lar gr i d of small elemen ts of mass, call ed pixels (|S aha & Wil liamsl 
ll997l : lDiego et alJl2005h . Alternatively. iBradac et al.l d2005h prefer 
tessellating the gravitational potential because its derivatives di- 
rectly yield the surface density and other important lensing quan- 
tities. Point-like pixels can also be replaced by radial basis func- 
tions (RBF). RBF are real- valued functions with radial symme- 
try. Several density profi les for the RBF have been t ested so far 
iLiesenborgs et al.l ([2007) use Plummer profiles, and iDiego et all 
d2QQ7h use RBF with Gaussian profiles. They also study the prop- 
erties of Power law and Isothermal profiles as well as Legendre 
and Hermite polynomials. They advise to use compact-like pro- 
files such as the Gaussian or the Power law profiles, since too ex- 
tended profiles produce a constant sheet excess in the res ulting sur- 
face m ass d ensity. Finally, in stead of using a regular grid JCoe et al.l 
d2008h and lDeb etail d2008h use the actual distribution of images 
as an irregular grid. Then, they either place RBF on this grid or 
directly estimate the derivatives of the potential at the images loca- 
tion. 

Whatever their implementation, the multiple images reproduc- 
tion is generally greatly improved with respect to traditional "para- 
metric" modelling. However, the qualification of these models is 
still a matter of debate. Indeed, because of their large number of 
free parameters with respect to the number of constraints, many dif- 
ferent models can perfectly fit the data. In order to identify the best 
physically motivated model and eventually learn something on dark 
matter distribution in galaxy clusters, external criteria (e.g. mass 
positivity) or regularization terms (e.g. to avoid unwanted high 
spatial frequencies) are required. In addition, galaxy mass scales 
are never taken into account, although traditional modelling have 
demonstrated that they effectively affect multiple images positions. 
This final step makes such "non-parametric" models a little un- 
certain. Nonetheless, "non-parametric" models are useful because 
their large flexibility allows the exploration of the mass distribution 
regardless of any a priori. For example, these "non-parametric" 



methods are efficient to reveal complex mass distribution such as 
found in the "Bullet Cluster". 

In this article, we study the properties of a model made of 
a multi-scale grid of RBF and a sample of analytically defined 
galaxy- scale DM halos. We analyse how this model compares to a 
traditional "parametric" model. We apply our analysis to the galaxy 
cluster Abell 1689 for its large amount of systems of multiple im- 
ages. In section 1, we present the analytic definition of our RBF. In 
section 2, we evaluate the ability of our multi- scale grid model in 
reproduci ng a simple NFW pro file. In section 3, we use the mass 
model of ILimousin et al.l d2007h as an input to build a multi- scale 
grid model of the galaxy cluster Abell 1689. In section 4, we fit 
this model to a subset of multiple images, and compare the pro- 
duced mass map, deviation angle map and shear map to the ones 
obtained with a traditional model. In addition, we perform an over- 
fitting check. To do so, we assume that if a model optimised with a 
subset of an image catalogue can accurately predict the rest of it, it 
does not over-fit the data. Therefore, we compare the RMS between 
predicted and observed images for the part of the image catalogue 
not used as constraints. Finally in section 5, we study how different 
values of parameters related to the grid building affect the quality of 
the fit, the density profile of the cluster, and the estimated properties 
of the galaxy-scale halos. When necessary, we use the flat ACDM 
concordance cosmology with Q m = 0.3, and H =14 km/s/Mpc. At 
the redshift of Abell 1689 z = 0.184, an angular scale of 1 arcsec 
corresponds to 2.992 kpc. 



2 LENSING EQUATIONS 

The lens equation : 

m = o-<w, (i) 

defines the transformation between the image position 6 and the 
source position 6. a(6 ) is the deflection angle due to the lens (e.g. 
ISchneider etaD(l992h . 

The amplification \i of an image located in 6 is inversely pro- 
portional to the determinant of the amplification matrix A 
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where the amplification matrix A is the derivative of the lens equa- 
tion at the image location 
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here expressed in the amplification basis, k is the convergence and 
y is the shear. 

Through the Fermat Principle, it is possible to demonstrate 
that the deflection angle a(6) is proportiona l to the gradient of 
the tw o-dimensional Newtonian potential felandford & Naravanl 
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which in turn is related to the surface density £ and the convergence 
k through the Poisson relation in 2D 
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I, cr i t is the critical density above which strong lensing is possi- 
ble. D l, D LS and D s are cosmological angular distances between 
the observer O, the lens L and the source S . 
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Table 1. Characteristics of hexagonal grids per level of splitting 



Figure 1. Adopted hexagonal geometry for the multi-scale grid and recur- 
sive splitting into equilateral triangles. 



Deflection angles are additive quantities. For instance, if in a 
cluster we consider N clumps of mass located in 6 U then each of 
them independently deflects a light beam crossing the cluster by an 
angle a t . The total deflection angle computed at an observed image 
position 6 is then 



a(6) = Y j a i (\6-6 i \). 



(6) 



Let us now define the RBF used to build the multi-scale grid. 
Its density profile is given by the analytical expression of the Trun- 
cated Isothermal Mass Distribution (TIMD), which is the circular 
version of the trunc ated Pseudo Isothermal Elliptical Mass Dis- 



tribution (PIEMD) (iKassiola & Kovnerl 1 19931 : iKneib et all 1 19961 : 
iLimousin etal] 120051 : lEKasdottir et al.1 l20Q9h . The analytical ex- 



pression of its surface density is 
X(R) = crlf(R,r core ,r cut ) 
with 

1 r 2 

x ' rut 



2G d, 
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/ defines the profile, and o 2 ^ defines the weight of the RBF. 
Note that this profile is characterised by two c hanges in its slop e 
marked by the r core and r cu , radius (see Fig. 1 in lJullo et all 2007). 
Within r core , the density is roughly constant, between r core and r cut , 
it is isothermal X oc r _1 , and beyond r cut it falls as E oc r~ 3 . Com- 
pared to the Gaussian RBF pro file used bvlDiego et al.1 020071) or the 
Plummer RBF profile used bv lLiesenborgs et all ( 1200% . the TIMD 
profile has a shallower slope in the centre but falls in a steeper man- 
ner after r cu t , thus preventing from the mass sheet excess noted by 
Dieg o et aD d2Q07h with the pure isothermal profile. In addition, the 
TIMD profile is physically motivated. Its total mass is finite as well 
as its central density. In this respec t, this profile is mo re physical 
than the notorious NFW potential dNavarro et al .11 1997b which fits 
non collisional dark matter numerical simulations but has an infi- 
nite c entral density and an infinite total mass (see ILimousin et al.1 
2005, in which TIMD and NFW potentials are compared). Finally, 
thanks to its flat core, the TIMD potential can produce extended flat 
regions, in particular in the centre of clusters if necessary. 



3 THE MULTI-SCALE GRID 
3.1 Definition and motivation 

In this section, we detail how we build the multi- scale model, and 
demonstrate its capabilities at reproducing a singular NFW mass 
profile. For the moment, we do not include lensing constraints. 
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As proposed bv lDiego et al.1 d2005h . we create a coarse multi- 
scale grid from a pixelated input mass map and recursively refine it 
in the densest regions. Doing so, the huge range in mass observed 
in galaxy clusters is efficiently sampled with a minimum number 
of grid pixels. In contrast to many previous works, we do not use 
a squared grid but an hexagonal grid, on the ground that it better 
fits the generally rounded shape of galaxy clusters. With such a 
geometry, it is straightforward to generate a triangular mesh, which 
is the best way to pack a set of radial basis functions. 

In practice, we start by bounding the field of interest with an 
hexagon centred on the cluster centre. We split it into 6 equilateral 
triangles as shown in Fig. [T] Then, we choose a simple splitting 
criteria. We have tested several criteria : total mass, standard de- 
viation or amplitude of surface density variations in a triangle and 
density of constraints, but none of them worked as well as the sur- 
face density threshold. Considering for instance an input mass map 
in a FITS image with pixels of 1 arcsec 2 , a triangle on this image 
is split into 4 sub-triangles if it contains a single pixel (i.e. a region 
of 1 arcsec 2 ) that exceeds a user defined surface density threshold. 
For instance, in order to trigger strong lensing regions, the threshold 
can be set equal to the critical density in M Q I arcsec 2 at the clus- 
ter redshift. Over-critical regions will be split whereas sub-critical 
regions will not. In the extreme case where the mass map is ev- 
erywhere greater than the threshold, it results into a regular grid in 
which the number of triangles increases as 3 x 2 2 " +1 and the number 
of triangle summits, or grid nodes, as N = 1 + 3 X (2 2n + 2 n ), where n 
is the level of recursive splitting. Table [T] summarises for some lev- 
els of splitting the maximum number of triangles and nodes a grid 
can contain. The level of splitting or equivalently the finest grid res- 
olution (~ 2~ n ) is set by the user. As stated above, we have tested 
models where the g rid is refine d at the multiple images location. 
Similarly as in lCoe et all d2008h . such models lead to perfect fits to 
the data. However, we note that they also easily get biased by the 
chosen set of data. 

Finally, RBF described by TIMD potentials are placed at the 
grid node location f . Their core radius is set equal to the size of 
the smallest nearby triangle and their cut radius is set equal to three 
times the core radius (this is discussed in Section [6}. Their weight 
cr 2 are obtained by inverting the following system of N equations 



M n 

M m 
with 



Mi; 



M N 



(9) 



(10) 



Si is the surface density read from the input mass map at the 
grid node location M tj is the value of a RBF with cr 2 = 1, 
centered on the grid node location 0j, and computed at a radius 
R = \6i -0j\. The product M t j cr 2 gives the contribution of this RBF 
to the surface density S t (see Eq.[7]). 
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Figure 2. Multi-scale grid made of 229 RBF, mapping an input mass dis- 
tribution with an NFW profile. The radius of the circles corresponds to the 
core radius of the RBF, i.e. locally the grid resolution. The central grey disk 
of radius 2 x with = 1.54" represents the central grid resolution 
beyond which mass map comparison is meaningless in Fig. [3] 



3.2 Reproducing a NFW profile with a multi-scale model 

With enough resolution multi- scale models can reproduce all sorts 
of input mass distributions, provided of course that no region has a 
slope steeper than £ oc r -3 (steepest slope of the TIMD potential). 
As an exercise, let us consider the input mass map produced by 
a cluster at a redshift z = 0.184 whose density profile follows a 
circular NFW profile with a concentration of 5 and a scale radius 
of 150". We compute a 200 x 200 arcsec 2 pixelated mass map of 
this object (i.e. about an ACS field of view) to be used as input mass 
map. To build the multi-scale grid, we start with an hexagon whose 
centre matches the centre of the NFW mass distribution and whose 
radius is set to 100". We set the splitting threshold to 4.7 10 10 M O 
/arcsec 2 , and limit to n = 6 the number of recursive splitting. Fig.[2] 
shows the produced multi-scale grid. The smallest triangle, i.e. the 
smallest element of resolution, is R 6 = ^j- = 1.54" wide. The grid 
contains 229 nodes, to which we associate RBF with TIMD profile. 
Their weight cr 1 are obtained by inverting Eq.[9] Note however, that 
we prefer an iterative method rather than a direct matrix inversion. 
Indeed, if we directly invert matrix M, we obtain a perfect fit but the 
solution vector co ntains negative erf, i .e. RB F with negative density 
profiles. Although iLiesenborgs et al.1 d2008h allow some RBF to be 
negative, we are more conservative and prevent any element from 
the cr 2 vector from becoming negative. To do so, we minimise the 
following quantity 



Z - ^ (S input (0j) - S P red(0/)) 



(11) 



7=1 



where S in p Ut and S prcd are the input and predicted Si quantities of 
Eq.[9] With this iterative inversion technique, we can force the RBF 
to have positive weights cr 2 , hence make sure that the overall sur- 
face density is positive. This way, we also avoid an additional reg- 
ularisation term (and possible related effects on the final results) 
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Figure 3. Reproduction of the input NFW profile (in black) by a grid with 
6 levels of splitting (solid red line) and 3 levels of splitting (dashed red 
line). The recovered profile with the 6 levels grid is the sum of the 229 
RBF (dotted lines) shown in Fig. [2] The two vertical lines at 2R^ and 2R^ 
mark the lower bounds of validity for the 6 levels grid and the 3 levels grid 
respectively. The dashed vertical line marks the hexagon limit at 100 V5/2 
arcsec. Residuals show that both models present similar levels of errors 
(< 5%) within their respective domain of validity. 



to control the sign of the surface density in the lensing x 2 defined 
below in Eq. [12] 

Fig.[3]shows the reconstructed radial density profile. Note how 
well the arrangement of RBF fits the input NFW density profile. 
The residual is lower than 5% on the meaningful domain from 
twice R 6 to the hexagon inner limit. Note that a similar residual 
is also achieved with 3 levels of splitting, though on the corre- 
sponding meaningful domain. Therefore, a large number of split- 
ting is not expressly justified unless the very centre of the mass 
distribution is of particular interest regarding some strong lensing 
constraints (e.g. radial arcs). 



4 MULTI-SCALE MODEL OF ABELL 1689 

In this section, we continue to study the capabilities of multi- scale 
models. However, in place of an NFW mass distribution, we build 
the grid from a more realistic input mass map based on a simpli- 
fied versio n of the mass model of the galaxy cluster Abell 1689 
reported in lLimousin et al.1 d2007h . In fact, our input model is made 
of 2 cluster-scale and 60 galaxy-scale clumps of mass, instead of 
the 190 galaxy-scale clumps used in Limousin et al. Only mas- 
sive clumps producing a deflection angle larger than 0.07" (i.e. 
roughly the HST/ACS imaging resolution) have been retained. All 
these clumps are described by PIEMD potentials. The galaxy- scale 
clumps are associated to cluster member galaxies. They follow 
scaling relations with similar properties as the one reported in 
Limousin et al. Fig. [5^ shows the convergence map produced by 
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Figure 4. Multi-scale grid made of 120 RBF obtained with 3 levels of 
splitti ng, and based on the input mass map of Abell 1689 dLimousin et al.l 



this model ( with H crit = 2 10 10 M Q /arcsec 2 calculated for a lens at 
redshift zi = 0.184 and a source at redshift infinity). 

Following the same scenario as in the previous section, we 
start building the multi- scale grid by bounding the field of inter- 
est with an hexagon of radius 100" centered on the image centre, 
also the cluster centre. We set the splitting threshold to the criti- 
cal surface density stated above. Doing so, we only oversample the 
central and North-East strong lensing regions, within which k > 1. 
Since the particular distribution and shape of the observed multi- 
ple images depend on the mass distribution in these regions, it is 
important to give them as much flexibility as possible. In a first at- 
tempt, we stop the levels of recursive splitting at 3. We obtain a grid 
of 120 RBF shown in Fig. [4] The size of the smallest triangle hence 
our smallest element of resolution is R3 = 12.3". In comparison, 
iLimousin et al.l d2007h estimate the core radius of the cluster to 33" 
which is more than twice R3. According to the Shannon rule, we 
have therefore enough resolution to model it and reproduce central 
systems of multiple images. The weights cr] of the RBF are com- 
puted by iterative inversion of Eq.[9]as detailed previously. 

Fig. \5\ shows a disagreement between the convergence map 
produced by this model and the input one, above all at the galaxy- 
scale clumps location. Consequently, the multi- scale model under- 
estimates by 9% the mass inside the hexagon. Beyond the hexagon 
limit, note the increasing disagreement between the 2 models. This 
is a modelling artefact of the multi- scale model due to the differ- 
ence in slope between the RBF, which falls as Z oc r -3 , and the 
input profile slope, which falls as E oc r _1 . Of course, such inaccu- 
racies in the convergence map induce serious discrepancies in terms 
of shear and deflection angle. Fig. [5]: shows large errors in the shear 
map at the galaxy- scale clumps location, but smaller errors far from 
these regions. In contrast, Fig. [5jl shows that in the deflection an- 
gle map, discrepancies are weaker, with only 10% of disagreement 
throughout. Note that the discrepancy observed at the very centre 
is due to the lack of resolution of the multi-scale model. Therefore, 
we conclude that the lack of resolution of the multi- scale model at 
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Figure 5. (a) Input convergence map of the galaxy cluster Abell 1689 for 
a source at redshift z s = 3. The hexagon of radius 100", bounds the region 
covered by the multi-scale grid, (b) Relative error map (K gr id - Ki nput ) I Ki nput 
in percent. The white contours correspond to A/c = 0. On average, the 
multi-scale model underestimates the surface density, (c) Relative shear 
error map \y grid - fi np ut\/\finputl (d) Relative defection angle error map 
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the galaxy- scale clumps location severely affects the lensing prop- 
erties of the model. 

Of course, more levels of splitting could improve the recov- 
ery, as demonstrated in the previous section with the NFW profile. 
However here, with 4 levels of splitting, the number of RBF raises 
to 318, and with 6 levels, it amounts to 4287. Optimising grids with 
so many clumps is currently beyond our computational resources. 

4.1 Adding galaxy halos 

Instead of blindly increasing the level of splitting, we better build 
a multi- scale model including both a multi- scale grid and galaxy- 
scale clumps. The grid thus becomes a flexible mass component 
to fit the cluster-scale distribution of mass, whereas galaxy-scale 
clumps fit small-scale irregularities. This appears as a cheap solu- 
tion to our problem of resolution. 



5 STRONG LENSING CONSTRAINTS ON 
MULTI-SCALE MODEL 

5.1 Methodology 

In this section, we investigate the ability at constraining such multi- 
scale models with strong lensing and compare the results in terms 
of mass distribution and image prediction RMS, to a reference 
analytical model. This comparison is performed on the observa- 
tional data of the galaxy cluster Abell 1689. First, we describe the 
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Bayesian MCMC sampler we use to optimise the parameters. Sec- 
ond, we present the multi-scale and the reference models, their free 
parameters, assumed priors, and the strong lensing constraints. Fi- 
nally, we analyse the results. 

5.2 Bayesian MCMC sampler 

Given their large number free parameters and the few number 
of constraints provided by strong-lensing data, "non-parametric" 
models are usually under-constrained, henc e the regularisation 
terms to obtain smooth mass distribution ( Marshall et 
iBradac et all 120051 : ISuvu et al.ll2006h . In contrast, we do not use 
explicit regularisation term because our arrangement of RBF into a 
regular grid, and the smooth and everywhere positive TIMD profile 
describing them, constitute an intrinsic regularisation scheme. In 
addition, equating the TIMD core radius of the RBF to the grid res- 
olution impedes strong discontinuities in the produced mass maps. 
Finally, the overlap of nearby RBF correlates the model's param- 
eters and makes them dependent, thus reducing the effective num- 
ber of free parameters. The number of constraints may then be- 
come larger than the number of effective parameters, and multi- 
scale models may not be under-constrained (see Section^. 

In order to chec k our Bayesia n MCMC sampler (based on 
the BayeSys libra r y dski lling 2004) implemented in lenstoolQ 
dKneib et al.lll993l : Julio et al. 2007)), we generate a mock cata- 
logue of 12 sources and 35 multiple images with the multi-scale 
model described above. Then, we check that the mass distribution 
estimated from this catalogue matches the input one. The^ 2 is com- 
puted in the source plane for simplicity and computation time. For 
each system i containing n t multiple images, we define^ 2 as 

where we use the lensing equation (Eq[T) to compute the source 
position pj of the observed image j, (J3) is the barycentre of the J3j, 
li j is the magnification for image j, and <Tj is the observational error 
at the position of image j. 

Fortunately, we detect no systematics in the recovery. The er- 
ror between the recovered and the input density maps is about 1% 
on average in the hexagon. The most outstanding result is that the 
algorithm converged towards the best fit region with only 3 x 10 8 
X 1 in less than 10 hours on a 2.4 Ghz processor, i.e. it performed 
more than 9000 x 1 1 s on average and computed only x 1 point 
per dimension. Note that a standard gradient method would have 

required at least 3 x 1 points per dimension to find a minimum i.e. 

3122 = 10 58 x i points 



from the input mass map, but still give it a substantial freedom. 
We also add the 60 galaxy-scale clumps retained in section [4] We 
assume that ( i) their position, ellipticity and orientation correspond 
to the brightness profile of their associated galaxy, ( ii) their mass 
is proportional to the total luminosity of the galaxy, i.e. they all 
have the same M/L ratio and (Hi) they are described by PIEMD 
potentials and their PIEMD parameters are linked by the following 
scaling relations 




where L*, r* ore , r* ut and crj are respectively the luminosity and the 
three PIEMD parameters of a typical early-type galaxy at the clus- 
ter redshift. We consider a cluster galaxy with luminosity L* cor- 
responding to an F775W magnitude of 17.5 4. We know from pre- 
vious studies in thi s cluster that r* ut 13" dLimousin et alJl2007L 
Halkola et al. 2007). Still, we assume a flat prior between 1" and 
30" in order to investigate the possible interactions between the grid 
and galaxy-scale clumps. Accordingly, we assume crj vary along a 
wide flat prior defined on the range 10 and 400 km/s. Finally, recent 
studies have shown that the density profile of earl y-type galaxies 
in the field is singular up to observational limits dKoopmans et al.l 
120061 : ICzoske et al.ll2008l) . Therefore, we fix r* ore to 0.03" (i.e. a 
small value but different from zero for numerical reasons). In total, 
this multi-scale model sums 122 free parameters, whose priors are 
reported in Table |2] 

Finally, note that all the parameters used to build the multi- 
scale grid (e.g the input mass map, the hexagonal shape, the 
hexagon size, the splitting algorithm, the level of splitting, the 
r cut I r core ratio and the surface density threshold) can potentially af- 
fect the inferred strong lensing mass distribution, and as such must 
be considered as priors as well. We further discuss this point in sec- 
tion [6] In particular, we look how the RMS of predicted multiple 
images improves with the number of splitting levels. 

5.4 Reference model 

As a reference model, we use a modified version of the Limousin 
et al. model with 2 cluster- scale clumps (for the main and the 
North-East halos), 3 galaxy- scale clumps described by individual 
PIEMD potentials to model the brightest cluster galaxy (BCG) and 
Galaxy 1 and Galaxy 2 that strongly affect systems 6, 24 and sys- 
tems 1, 2 respectively, and the 60 galaxy-scale clumps of the multi- 
scale model described by scaling relations. In total, the reference 
model sums 33 free parameters, whose priors are reported in Ta- 
bled] 



5.3 Multi-scale model and priors 

We consider the multi- scale model of section [4] to which we add 
galaxy- scale clumps. The model then contains a multi- scale grid of 
120 RBF, its smallest element of resolution is R 3 = 12.3" and it has 
a ratio r cu t/r C ore = 3. We fix the profile and positions of the RBF. 
Their weights cr\ are the only free parameters. We allow them to 
vary along a flat prior within the range 0.1 and 1.9 times the values 
found in section [4] by iterative inversion of Eq. [9] As indicative 
values, the minimum and maximum limits we assign are km/s and 
1922 km/s. This way, we prevent the model from wandering too far 

1 publicly available at http : //www . oamp . f r/cosmology/lenstool 



5.5 Strong lensing constraints in Abell 1689 

From Limousi n et al ] d2007h we select a catalogue of 28 images 
in 12 systems of multiple images (see Table |4). We will use the 
rest of the images later to check for model predictability and over- 
fitting. Selected images must have a measured spectroscopic red- 
shift (Richard et al., prep.), or present at least two star-forming re- 
gions clearly visible in all the counter-images, in order to avoid 
identification mistakes. We also remove images whose light is 
blended with the light of nearby galaxies. To get accurate measure- 
ments of the images positions, we fit a Gaussian profile to their 
light distribution with the IRAF task imexam. Because our selection 
mostly contains compact images, we measure a Gaussian width of 
0.13" on the position measurements. We use this positional error 
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Table 2. Priors the multi- scale model 



Nb. clumps r cut /r core 


Level of splitting <x; r* ore 


'cut 




< 


120+60 3 


3 [0,1922] km/s 0.03" 


[1,40]" 


[10,400] km/s 


16 



Table 3. Priors on the reference model 



ID 


R.A. 


Decl. 


e 


e 


r core (kpc) 


r CM ,(kpc) 


cr (km/s) 


Clump 1 


[-15,15] 


[-15,15] 


[0.1,0.55] 


[0,180] 


[30,150] 


1500 


[1000,1700] 


Clump 2 


[-90,-35] 


[5,79] 


[0.4,0.9] 


[0,180] 


[25,90] 


500 


[300,650] 


BCG 


[-10,10] 


[-10,10] 


[0,0.6] 


[0,180] 


[0.1,10] 


[9,550] 


[300,680] 


Galaxy 1 


49.0 


31.5 


[0,0.9] 


[0,180] 


[0.1,30] 


[9,180] 


[150,280] 


Galaxy 2 


[-49,-45] 


[27,35] 


[0,0.9] 


[0,180] 


[0.1,20] 


[4,190] 


[200,580] 


L* elliptical galaxy 










0.15 


[20,60] 


[150,280] 



Notes : Coordinates are given in arcseconds with respect to the BCG (R.A. = 13:11:29 Decl. = -01:20:27). 
The ellipticity e is the one of the mass distribution, expressed as a - b 2 /a 2 + b 2 . 



Table 4. Multiply imaged systems used as constraints 



ID R.A. Decl. (x 2 ) z spe c Image plane rms (arcsec) Image plane rms (arcsec) 













Multi-scale model 


Reference model 


1 






1.91 


3.0 


0.71 


0.67 


1.1 


13:11:26.44 


-1:19:56.37 


0.96 




0.99 


0.94 


1.3 


13:11:29.76 


-1:21:07.31 


0.96 




0.09 


0.16 








1.28 


2.5 


0.28 


0.59 


2.1 


13:11:26.52 


-1:19:55.07 


0.64 




0.40 


0.78 


2.4 


13:11:29.80 


-1:21:05.95 


0.64 




0.08 


0.29 


4.1 






5.07 


1.1 


0.22 


0.67 


4.11 


13:11:32.16 


-1:20:57.33 


1.23 




0.27 


0.85 


4.12 


13:11:30.51 


-1:21:11.90 


0.90 




0.25 


0.89 


4.13 


13:11:30.75 


-1:20:08.01 


1.64 




0.14 


0.19 


4.14 


13:11:26.28 


-1:20:35.06 


1.30 




0.21 


0.50 


4.2 






1.33 


1.1 


0.18 


1.08 


4.21 


13:11:32.10 


-1:20:58.31 


0.66 




0.17 


1.29 


4.22 


13:11:30.65 


-1:21:11.41 


0.66 




0.18 


0.83 


5 






4.35 


2.6 


0.36 


1.01 


5.1 


13:11:29.06 


-1:20:48.41 


2.18 




0.49 


2.18 


5.3 


13:11:34.11 


-1:20:20.87 


2.18 




0.15 


0.49 


6 






1.73 


1.1 


0.17 


0.31 


6.1 


13:11:30.75 


-1:19:37.90 


0.87 




0.21 


0.40 


6.2 


13:11:33.34 


-1:20:11.97 


0.87 




0.11 


0.16 


7 






1.93 


4.8 


0.16 


0.23 


7.1 


13:11:25.44 


-1:20:51.52 


0.96 




0.19 


0.23 


7.2 


13:11:30.67 


-1:20:13.79 


0.96 




0.12 


0.23 


10.1 






0.84 


1.8 


0.10 


0.27 


10.11 


13:11:33.96 


-1:20:50.99 


0.42 




0.09 


0.28 


10.12 


13:11:28.05 


-1:20:12.28 


0.42 




0.10 


0.26 


10.2 






0.68 


1.8 


0.08 


0.18 


10.21 


13:11:33.95 


-1:20:51.54 


0.34 




0.08 


0.20 


10.22 


13:11:28.08 


-1:20:11.84 


0.34 




0.08 


0.17 


10.3 






0.77 


1.8 


0.08 


0.19 


10.31 


13:11:33.96 


-1:20:51.53 


0.39 




0.08 


0.20 


10.32 


13:11:28.08 


-1:20:12.24 


0.39 




0.09 


0.18 


24.1 






3.34 


2.6 


0.17 


0.52 


24.11 


13:11:29.18 


-1:20:56.04 


1.63 




0.18 


0.79 


24.13 


13:11:30.29 


-1:19:33.86 


0.78 




0.20 


0.25 


24.14 


13:11:33.71 


-1:20:19.82 


0.94 




0.13 


0.33 


24.2 






2.92 


2.6 


0.18 


0.40 


24.21 


13:11:29.22 


-1:20:55.28 


0.77 




0.15 


0.38 


24.23 


13:11:30.25 


-1:19:33.26 


1.30 




0.25 


0.55 


24.24 


13:11:33.69 


-1:20:18.80 


0.85 




0.12 


0.21 
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Figure 6. Relative contributions of the PCA eigenvalues to the total vari- 
ance of the samples distribution in the our parameter space with 122 dimen- 
sions. Note that 90% of the variance is reproduced with only 12 effective 
parameters. 



for all the images in the lensing^ 2 (EqfT2l). Since we only use the 
image position as a constraint, we sum 32 constraints. 



5.6 Computational considerations 

On a 2.4 Ghz processor, the Bayesian estimation of the 122 free 
parameters took 15 days to produce about 20000 MCMC samples. 
Although it could be considered as quite computational intensive, 
considering the number of free parameters and the fact that we not 
only find the best fit region but also explore the parameter space in 
its neighbourhood, it is in fact very efficient. In the last section, we 
discuss some issues related to the number of samples. 



5.7 Results 

5. 7. 1 Images prediction 

Results confirm the ability of multi- scale models at being used as 
lens models. Indeed, the RMS per system and per image reported 
in Tableland shown in Fig.[7k highlight the good precision we ob- 
tain. The RMS averaged over all the systems is 0.28". In contrast, 
the reference model optimised with the same catalogue of images 
produces a mean RMS of 0.54". For all the systems unless Sys- 
tem 1, RMS with the multi- scale model are lower than RMS with 
the reference model. System 1 has a slightly larger RMS because 
the galaxy-scale clump located 3.2" to the West may not fit the 
same scaling relations as the other galaxy- scale clumps. Indeed, if 
we assume that 2 galaxies do not follow the same scaling relations, 
there is no perfect solution for the scaling relations parameters. The 
images producing the largest x 2 bias the scaling relations in their 
favor. The fit of images with a lower x 1 but close to galaxies fol- 
lowing other scaling relations worsens. In this respect, note that for 
System 1, % 2 = 1-91 and RMS= 0.71" whereas for System 24.1 



XT 



3.34 and RMS= 0.17". Both systems are at the same distance 



of a galaxy-scale clump. System 24.1 is therefore likely biasing the 
scaling relations in its favor. 

Given the large number of free parameters compared to the 
number of constraints, we could have expected better RMS and 



1.5 



1.0 - 



(a) 



a Optimised images 
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Figure 7. (a) Image plane RMS of the images used as constraints in func- 
tion of their distance to the closest galaxy. In black: RMS obtained with the 
multi-scale model. In red: RMS with the reference model, (b) RMS pro- 
duced by the images kept for the over-fitting check. The two subsets give 
similar RMS, indicating no particular sign of over-fitting with the multi- 
scale model. 



smaller^- 2 . Indeed, it is usually accepted that models with more free 
parameters than constraints allow an infinity of perfect solutions. 
Since our solutions are not perfect, it means that the number of 
useful effective parameters in our model is actually lower than the 
number of constraints. To estimate this number, we analyse the dis- 
tribution of the MCMC samples in the parameter space by means of 
the Principal Component Analysis (PCA) technique. Fig. [6] shows 
that 90% of the variance of the distribution is reproduced with only 
12 effective parameters out of the 122 'real' parameters. With the 
reference model, 90% of the variance is reproduced with only 10 
effective parameters out of the 33 'real' ones. In both cases, the 
number of effective parameters is lower than the 32 constraints, 
hence the non perfect fit. Though very interesting, investigating the 
physical meaning of these effective parameters is out of the scope 
of this paper. 

As a definite confirmation that our multi- scale model is not 
under-constrained and do not over-fit the data, we apply the cross- 
checking technique : with both the multi- scale and the reference 
models, we predict the images positions for the part of the images 
catalogue not used as constraints, and compare the RMS between 
predicted and observed positions. This way, if a model is biased to- 
wards the subset of images used as constraints, it should be unable 
to give accurate predictions. FiglTJ} shows the RMS given the two 
models for the not optimised part of the images catalogue. On av- 
erage, we find an RMS of 3.32" for the multi-scale model and an 
RMS of 3.49" for the reference model. Since the two models give 
similar predictions, we conclude that the multi-scale model does 
not over-fit the data. 

In addition, Fig.|7]3 also shows that when images get closer to 
galaxies their RMS increases. This increasing RMS again suggests 
that galaxy- scale clumps do not perfectly fit the imposed scaling 
relations. There must exists a scatter in the scaling relations that 



images seem to be sensible to, and that should be included in our 
future models. 

5. 7.2 Convergence, deviation and shear maps 

Fig. [8] compares mean convergence, deviation and shear maps pro- 
duced with the reference and the multi- scale models. To produce 
these maps, we compute the convergence, deviation and shear maps 
for each mass model of the MCMC chain, and compute the mean 
maps by averaging the values of each pixel of each map. First of 
all, note that the two convergence maps are very similar. The better 
RMS obtained with the multi- scale model does not originate from 
any particular missing clump in the reference model. The mass en- 
closed at the Einstein radius M(< 45") are also very similar with 
less than 1% difference. In other words, the "mass follows light" 
assumption in traditional modelling holds. Nonetheless, the bet- 
ter RMS obtained with the multi- scale model attests a significantly 
higher degree of flexibility. In particular, we note that in the con- 
vergence map, the North-East clump, at intermediate scale between 
cluster and galaxy scale, looks smoother and more detached from 
the main clump than in the convergence map produced by the ref- 
erence model. The deviation maps produced by the two models are 
also very similar with less than 5% of difference at the images po- 
sition. Finally the errors in the shear maps are mostly lower than 
10% at the images position. This is currently below what we can 
observationally constrain by measuring the ellipticity of multiple 
images. Consequently, the better RMS with the multi- scale model 
is merely due to its large number of free parameters, which allow a 
refined modelling of the mass distribution irregularities at interme- 
diate scale between cluster and galaxy scale. 

5. 7. 3 Error mass map 

In addition to the mean convergence map, the Bayesian approach 
also allows to compute the standard deviation and the S /N conver- 
gence maps. Fig. [9] shows that the mass distribution is estimated 
with high confidence, since the S/N is everywhere larger than 10 
inside the hexagon, i.e. less than 10% error. This means that the 
correlations between the RBF parameters, highlighted by the PC A 
technique, must strongly restrict the range allowed by the priors, 
hence the small error. Furthermore, the smooth iso-mass contours 
confirm the ability of multi-scale models at producing smooth mass 
maps. 



6 DISCUSSION 

In the previous section, we have worked at demonstrating the 
strength of our model. We are now going to highlight some criti- 
cal aspects which will give the reader a more complete view of the 
multi-scale models, but also of "non-parametric" models in gen- 
eral, for which the effects of grid parameters on the final results are 
rarely investigated in details. In this section, we particularly inves- 
tigate two parameters : (i) the r cut /r core ratio defining the RBF con- 
centration and ( ii) the level of splitting. We do not investigate the 
threshold parameter because we only aim at oversampling strong 
lensing regions. Therefore hereafter, in all the models, the surface 
density threshold is set equal to E cnY = 2 x 10 10 M Q /arcsec 2 . 

To begin with, we build 5 multi- scale models similar to the one 
used so far, but with different r cut /r core ratios and levels of splitting. 
Some models built with 3 levels of splitting have 127 clumps be- 
cause the grid has been shifted by 6" with respect to the input mass 
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Figure 9. Map of S/N ratio. Coloured contours bound regions with S/N 
greater than 300, 200, 100 and 10, the highest S/N region being at the centre. 
Red contours are mean iso-mass contours from 5 to 0.77 x 10 12 M Q /arcsec 2 
by steps of 1 and 0.5 x 10 12 M Q /arcsec 2 for the three outer contours. This 
map has been computed with 20000 MCMC samples. Black boxes mark the 
positions of the multiple images used to constrain the mass distribution. 

map. We found that this shift does not affect the final results. We 
optimise each model with our catalogue of 28 images and report 
the mean RMS and mean likelihood (log(L)) = -0.5 (x 1 ) in Ta- 
ble [5] Note also that less MCMC samples have been gathered with 
models A and C in order to evaluate how the number of MCMC 
samples affects the error estimation. 

6.1 The r cut /r core ratio 

The r cut I r core ratio characterises the concentration of the RBF. Mod- 
els with small ratios are more flexible because their RBF are more 
independent. Table [5] shows that such models better fit multiple im- 
ages but can also over-fit them. In contrast, models with larger ra- 
tios produce more extended and overlapping RBF, which worsens 
the fit to multiple images. For instance, model A with r CM? /r core = 2 
(i.e. with very concentrated RBF) gives a better fit than model D, 
with r cut I r core = 10. If we calculate their likelihood ratio we find 
\og(L D /L A ) = 113, which means that model D is clearly ruled out. 

In addition to the fit quality, the r CM? /r core ratio affects the scal- 
ing relations parameters r* ut and crj. Fig. [TOl shows that models 
with large r CMf /r core ratio (i.e. extended RBF) induce large r* ut and 
small crj. To understand this behaviour, Fig. Qj] shows that in such 
models, most of the RBF have a small velocity dispersion, i.e. little 
weight. The mass distribution must then be very smooth with only a 
few very extended and effective RBF. The r* ut extension parameter 
becomes large, in order for the galaxy- scale clumps to accommo- 
date the multiple images and compensate for the lack of effective 
RBF in their neighbourhood. Besides, whatever the r CMf /r core ratio, 
Fig.fTOlshows that the curves of constant M/L ratio agree at 2cr, and 
Table[5]shows that for a given level of splitting, the M ga i/M tot ratios 
are similar. In conclusion, the r CM? /r core ratio affects the extension of 
the galaxy-scale clumps but do not affect their mass. 



10 E. Julio &J.-R Kneib 




0.2 0.4 0.6 0.8 1 1.2 1.4 0.1 0.2 0.3 0.4 0.5 0.6 




30 35 40 45 50 55 60 0.02 0.06 0.1 0.14 0.18 




0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.1 0.2 0.3 0.4 0.5 0.6 0.7 



Figure 8. Comparison between maps obtained with the reference model (left column) and the multi-scale model (middle column). The color images correspond 
to the relative difference |Reference - grid|/|Reference| between the two maps in grey. All the maps are computed for a source plane at redshift zs = 3. The 
red hexagon marks the limit of the multi-scale grid beyond which the maps become meaningless. The black crosses in the color images mark the positions of 
the images used as constraints. 
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Table 5. List of multi- scale models used in this section 



ID 


r 'cut h "core ratio 


Nb of splitting levels 


Nb of clumps 


Nb samples 


M ga i /M tot ratio 


Mean Image plane rms 


<log(L)> 


A 


2 


3 


127 


439 


14% 


0.23" 


-14.8 


B 


3 


3 


120 


20920 


13% 


0.27" 


-13.4 


C 


4 


3 


127 


369 


12% 


0.33" 


-28.2 


D 


10 


3 


127 


25029 


11% 


0.86" 


-228.2 


E 


3 


4 


318 


38399 


5% 


0.22" 


-13.1 


Reference 








1000 


22% 


0.57" 


-59.4 
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Figure 10. Confidence intervals at 68% and 99% of the scaling relations 
parameters r* ut and cr* obtained with models A, C and D. Dashed lines 
show the curves of constant M/L ratio within a 60" aperture (i.e. the mean 
distance from multiple images to galaxy-scale clumps). In black is shown 
the same curve but for the Limousin et al. model. The vertical dashed limit 
at 2/?3 = 24.6" marks the grid resolution of model A. The grid resolution of 
models C and D at 4R?, and IOR3 respectively, are beyond the abscissa scale 
limit of this plot. This figure shows that galaxy-scale clumps efficiently in- 
crease the grid resolution. 



As already noticed bv lDiego et al.1 {2007), extended RBF pro- 
duce a mass sheet excess at large radius. Fig. [T2l shows that model 
D with a large r cut /r core ratio produces a density profile with a shal- 
lower slope than model A. Since with larger ratios, the RBF be- 
come more extended, their cut-off radius is pushed out to larger 
radius. Their isothermal slope is maintained over a wider range in 
radius, making the total density profile shallower. 

6.2 The level of splitting 

Another prior involved in the building of the grid is the level of 
splitting. Fig. [13] shows that in contrast to the r cut /r core ratio, the 
level of splitting does not affect the slope of the density profile. 
The profiles obtained with models B, E and the reference model are 
very similar. In addition, Table [5] shows that models B and E have 
similar RMS and likelihood. Therefore, more levels of splitting is 
not expressly justified given the data. Note also that the reference 
model produces a similar density profile but has an RMS twice the 
RMS obtained with models B or E. Similar density profiles does 
not necessarily mean similar fit to the data. 

Fig-Q3]shows that the level of splitting affects the scaling rela- 
tions parameter r* r With model E, r* ut 15" whereas with model 
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Figure 11. Histograms of the RBF velocity dispersions in models A, B, 
C and D, as distributed before strong lensing optimisation (these distribu- 
tions slightly shrink after optimisation, i.e. small values get larger and large 
values get smaller). Note that multi-scale models are mainly made of RBF 
with small velocity dispersion. As RBF get more extended, more RBF are 
assigned small velocity dispersions. 



B, r* ut 25". In addition, it shows that with model E, r* ut is close 
to the grid resolution 3R4 = 18.4", whereas with model B, it is 
significantly smaller, since 3R 3 = 36.9". It seems therefore that by 
increasing the level of splitting, we just replace the galaxy-scale 
clumps by the RBF of the grid. The large errors in model E indeed 
indicate that the galaxy- scale clumps are not constrained anymore 
by the data. Besides, Fig. [14] and Table \5\ show that the M/L ra- 
tio decreases as the level of splitting increases. The contribution of 
the galaxy-scale clumps to the total mass decreases from 13% in 
model B to 5% in model D. In other words, with 4 levels of split- 
ting, galaxy- scale clumps do not help anymore in increasing the 
model resolution. The scaling relations inflexibility might imped 
them from decreasing in size, to make the model gain in resolu- 
tion and improve the fit. Again, one solution could be to introduce 
a scatter in the scaling relations. We will explore this solution in a 
forthcoming paper. 

6.3 Invariant quantity 

So far, we have found that both the r cut /r core ratio and the level of 
splitting affect the estimation of the scaling relations parameters 
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Figure 12. Convergence profiles obtained with models A, C and D. Larger 
r cu t I r core ratios make the profile shallower. The vertical arrows radially lo- 
cate the strong lensing constraints. 
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Figure 14. Confidence levels at 68% and 99% of the scaling parameters r* ut 
and crj recovered with multi-scale models B (green) and D (blue). Dashed 
lines show the curves of constant M/L ratio within a 60" aperture. Dashed 
limits at and 3/?4 mark the extension of the smallest RBF for models 
B and D respectively. Note that with 4 levels, the galaxy-scale clumps are 
completely unconstrained because they directly compete with the RBF. 
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Figure 13. Density profile comparison between multi-scale models ob- 
tained with 3 and 4 levels of splitting, hence made of 120 and 318 RBF 
respectively, and 60 galaxy-scale clumps. The vertical arrows radially lo- 
cate the strong lensing constraints. In this region, the density profiles are 
similar. Outside, their are mostly driven by the priors. 
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Figure 15. Galaxy-scale clumps mass measurements within a 4" aperture, 
performed on pixelated mass maps resulting from the fit of models A, B, C, 
D and E to the same subset of multiple images. Measurements are very con- 
sistent with a scatter of about 10%, in agreement with the errors observed 
on the radial density profiles. 



r *uv °"o' as we ^ as me rat ^° °f me g a l ax y _sca l e clumps. In 
this context, studying the galaxy- scale clumps physical properties 
seems a little risky. Nevertheless, it is possible to get more reli- 
able values by making measurements directly on the surface den- 
sity maps. In Fig. Q3] we com pute the aperture mass of galaxy- 
scale clumps with Sextractor feertin & Arnouts|[l996l) and pixe- 



lated mass maps obtained for models A, B, C, D and E. We choose 
Sextrator because it affords a multi-threshold algorithm to assign 
the mass in a pixel to the most credible of two nearby galaxies. 
We find that the masses enclosed in a 4" aperture (i.e. the smallest 
distance between two nearby galaxy- scale clumps) are almost unaf- 
fected by the grid parameters. The scatter is of the order of 10%, in 
agreement with the errors found in the density profiles. Estimating 
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Figure 16. Evolution of the cumulated relative error of the parameters of 
model B in function of the number of MCMC samples. The error does not 
stabilise even after more than 20000 samples. 



the galaxy- scale clumps properties directly from the surface den- 
sity maps seems therefore a more reliable and better constrained 
solution (given the priors) than simply considering the r* ut and cr* 
modelling parameters. 



6.4 Errors analysis 

As already stressed, given the size of the parameter spaces, it is en- 
couraging to see our Bayesian MCMC sampler converging to the 
best fit region in less than 5000 samples. However, Fig. ITU shows 
that the error bars increase between model B and D. Since model 
D sums twice as many samples as model B, it seems that the esti- 
mation of the errors depends on the number of computed samples. 
To study the evolution of the errors estimation with the number of 
MCMC sample with the number of MCMC sampless, we define 
the cumulated relative error quantity computed over all the free pa- 
rameters of the model as 



Cumulated parameters error = 




where cr[X,-] and E[X t ] are the standard deviation and the mean 
value of the X t MCMC random variable for parameter i. Fig. Q21 
shows that this error has not converged even after more than 20000 
samples. Similarly, we define the error on the density profile as 



Density profile error = 




where Si is a random variable for the radial density profile inte- 
grated on the range to 100". In contrast to the error on the free 
parameters, Fig. [IT] shows that the error on the density profile con- 
verges after only 1500 samples. Therefore, accurate errors on the 
mass distribution are fast to compute, whereas errors on the param- 
eters of the model, such as r* ut or cr* for instance, are difficult to 
estimate accurately and might actually be underestimated. 
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Figure 17. Evolution of the relative error on the radial density profile of 
model B in function of the number of MCMC samples. The error stabilises 
soon after 1500 samples. 

7 CONCLUSION 

In this paper, we present a multi- scale model sufficiently flexible to 
reproduce the observed systems of multiple images with high ac- 
curacy, but also robust against over-fitting. The model combines a 
grid of radial basis functions (RBF), and galaxy-scale clumps host- 
ing cluster member galaxies, described by PIEMD potentials and 
scaling relations. 

We apply this model to the galaxy cluster Abell 1689. We 
constrai n the model with a su bset of multiple images extracted 
from the lLimousin et al.l d2007h catalogue. We obtain an RMS be- 
tween observed and predicted positions of 0.28" in the image plane, 
i.e. about half the RMS obtained with a slightly modified version 
of the Limousin et al. model (the reference model). We confirm 
predictability of our multi-scale model by cross-checking the op- 
timised model with the part of the images catalogue not used as 
constraints. We find similar RMS with the multi- scale and the ref- 
erence models. This confirms that we do not overfit the data, despite 
the large number of free parameters. We propose a Principal Com- 
ponent Analysis (PCA) technique to estimate the effective number 
of free parameters. We find that our multi- scale model with 122 
parameters contains only 12 effective parameters. Our multi-scale 
model produces smooth mass maps and radial density profiles. 

Then, we compare the convergence, deviation and shear maps 
obtained both with the multi- scale and the reference models. We 
find no major difference between the two sets of maps. This means 
that the better RMS is due to minor changes allowed by the flexi- 
bility of the multi- scale model at intermediate scale between clus- 
ter and galaxy scale. We also note that the galaxy-scale clumps 
are better integrated to the North-East cluster- scale clump with the 
multi-scale grid than with the traditional reference model. The bet- 
ter modelling of this cluster- scale component, embedded in projec- 
tion in the main cluster core, thus illustrates how multi- scale mod- 
els can reproduce irregular mass distributions. 

Finally, we study how changes on the grid parameters affect 
the density profile, the scaling relations parameters crj, r* ut , and 
the M/L ratio of the galaxy- scale clumps. We find that galaxy- 
scale clumps efficiently increase the resolution of the grid of RBF 
without modifying the density profile. Excessively raising the level 
of splitting in order to increase the grid resolution is thus not ex- 
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pressly justified. We also find that the scaling relations parame- 
ter r* ut , related to the galaxy- scale clumps extension, decreases as 
the RBF get more concentrated. This degeneracy leads to several 
models producing the same radial density profile. Fortunately, the 
likelihood is also significantly affected, hence allowing an effective 
model selection. In spite of this degeneracy between the model's 
parameters, we propose a reliable solution to measure the galaxy- 
scale clump mass into an aperture directly on the mass maps pro- 
duced by the optimised models. Indeed, we note that the model's 
parameters degenerate, but the resulting mass distributions are little 
affected by the priors on the grid. 

This work raises the issue of accurat ely measuring sub- 
structure properties in galaxy clusters ( lHalkola et al.l 120071 ; 
iNataraian etal]l200l [Smith & Tavlodl2008h . The degeneracy be- 
tween galaxy-scale and cluster-scale mass components can poten- 
tially lead to spurious conclusions. In traditional modelling of ob- 
servationally quiet clusters, whatever the priors on the adopted pro- 
file for the cluster-scale component (e.g. PIEMD or NFW profile), 
the substructure properties are little affected, given the constraints. 
However, in unrelaxed perturbed clusters, pursuing this study with 
the same technique appears to be more risky. Our multi- scale model 
as described above offers an interesting solution. 

In terms of computation time, this method is still slow. We 
could make it faster by restricting the number of gradients and 
Laplacians calculated per image to solely the most significant. 
However, this has to be treated with care since the sum of negligi- 
ble gradients can become significant. Missing such gradient s could 
lead to spurious results. Look for instance Ibeb et al.l d2008h to un- 
derstand how complex it is to accurately select the relevant clumps 
for gradient calculation. 

Multi- scale models open interesting avenues for the modelling 
of galaxy clusters. Indeed, today standard parametric methods are 
facing their own limitations. New lensing resul t s reveal irregu- 
lar or merging clus ters (e.g. iBradac et al.l 120061 ; |jee et "aLl l200l 
iMahdavi et al ll2007l) . Multi-scale models with RBF afford the ro- 
bustness of parametric methods as well as the flexibility of non- 
parametric methods. Since we use analytical potentials, the exten- 
sion of the method to weak lensing is straightforward. We are cur- 
rently working at combining strong and weak lensing signal to ex- 
tend the accuracy of our modelling to larger radius. In inner regions, 
the strong lensing constraints will need a dense network of RBF to 
be reproduced accurately whereas in the outskirts, a coarser sam- 
pling will perfectly fit with the lower density of weak lensing con- 
straints (see Julio et al. in prep). This method can also be extended 
to multiplane lensing, allowing thus tomographic analysis. 



The authors thank Phil Marshall, Harald Ebeling, Marceau 
Limousin and Johan Richard, as well as the referee for enlighten- 
ing and fruitful comments. JPK and EJ thank support from CNRS 
and SL2S grant 06-BLAN-0067 from the Agence National de la 
Recherche. 



REFERENCES 

Bertin E., Arnouts S., 1996, Astronomy & Astrophysics Supple- 
ment, 117, 393 
Blandford R., Narayan R., 1986, ApJ, 310, 568 



Bradac M., Clowe D., Gonzalez A. H., Marshall P., Forman W, 
Jones C, Markevitch M., Randall S., Schrabback T., Zaritsky D., 
2006, ApJ, 652, 937 

Bradac M., Schneider P., Lombardi M., Erben T., 2005, A&A, 
437, 39 

Broadhurst T., Bemtez N., Coe D., et al 2005, ApJ, 621, 53 
Clowe D., Bradac M., Gonzalez A. H., Markevitch M., Randall 

S. W, Jones C, Zaritsky D., 2006, ApJ, 648, L109 
Coe D., Fuselier E., Bemtez N., Broadhurst T., Frye B., Ford H., 

2008, ApJ, 681,814 
Czoske O., Barnabe M., Koopmans L. V. E., Treu T., Bolton A. S., 

2008, MNRAS, 384, 987 
Czoske O., Moore B., Kneib J.-R, Soucail G., 2002, A&A, 386, 

31 

de Blok W. J. G., Bosma A., McGaugh S., 2003, MNRAS, 340, 
657 

Deb S., Goldberg D. M., Ramdass V. J., 2008, The Astrophysical 
Journal, 687, 39 

Diego J. M., Protopapas P., Sandvik H. B., Tegmark M., 2005, 

MNRAS, 360, 477 
Diego J. M., Tegmark M., Protopapas P., Sandvik H. B., 2007, 

MNRAS, 375, 958 
EKasdottir A., Limousin M., Richard J., Hjorth J., Kneib J.-R, 

Natarajan P., Pedersen K., Julio E., Paraficz D., 2009, ArXiv e- 

prints 

Fu L., Semboloni E., Hoekstra H., Kilbinger M., van Waerbeke 
L., Tereno I., Mellier Y., Heymans C, Coupon J., Benabed K., 
Benjamin J., Bertin E., Dore O., Hudson M. J., Ilbert O., et al. 
2008, A&A, 479, 9 

Gavazzi R., Treu T., Rhodes J. D., Koopmans L. V. E., Bolton 
A. S., Buries S., Massey R. J., Moustakas L. A., 2007, ApJ, 667, 
176 

Gentile G., Tonini C, Salucci P., 2007, A&A, 467, 925 

Guzzo L., Pierleoni M., Meneux B., Branchini E., Le Fevre O., 
Marinoni C, Garilli B., Blaizot J., De Lucia G., Polio A., Mc- 
Cracken H. J., Bottini D., Le Brun V., Maccagni D., Picat J. P., 
Scaramella R., et al. 2008, Nature, 451, 541 

Halkola A., Seitz S., Pannella M., 2006, MNRAS, pp 1133-+ 

Halkola A., Seitz S., Pannella M., 2007, ApJ, 656, 739 

Hu W, Dodelson S., 2002, ARA&A, 40, 171 

Jee M. J., Ford H. C, Illingworth G. D., White R. L., Broad- 
hurst T. J., Coe D. A., Meurer G. R., van der Wei A., Bemtez 
N., Blakeslee J. P., Bouwens R. J., Bradley L. D., Demarco R., 
Homeier N. L., Martel A. R., Mei S., 2007, ApJ, 661, 728 

Julio E., Kneib J.-R, Limousin M., EKasdottir A., Marshall P. J., 
Verdugo T., 2007, New Journal of Physics, 9, 447 

Kassiola A., Kovner I., 1993, ApJ, 417, 450 

Kneib J.-R, Ellis R. S., Smail I., Couch W. J., Sharpies R. M., 
1996, ApJ, 471, 643 

Kneib J. P., Mellier Y., Fort B., Mathez G., 1993, A&A, 273, 367 

Koopmans L. V. E., Treu T., Bolton A. S., Buries S., Moustakas 
L. A., 2006, ApJ, 649, 599 

Liesenborgs J., de Rijcke S., Dejonghe H., Bekaert P., 2007, MN- 
RAS, 380, 1729 

Liesenborgs J., de Rijcke S., Dejonghe H., Bekaert P., 2008, MN- 
RAS, 386, 307 

Limousin M., Kneib J.-R, Natarajan P., 2005, MNRAS, 356, 309 
Limousin M., Richard J., Julio E., Kneib J. P., Fort B., Soucail 
G., EKasdottir A., Natarajan P., Ellis R. S., Smail I., Czoske O., 
Smith G. P., Hudelot P., Bardeau S., Ebeling H., Egami E., Knud- 
sen K. K., 2007, ApJ, 668, 643 
Limousin M., Richard J., Kneib J.-R, Brink H., Pello R., Julio 



Multi-scale cluster lens mass mapping 

E., Tu H., Sommer-Larsen J., Egami E., Michalowski M. J., Ca- 

banac R., Stark D. P., 2008, A&A, 489, 23 
Mahdavi A., Hoekstra H., Babul A., Balam D. D., Capak P. L., 

2007, ApJ, 668, 806 
Marshall P. J., Hobson M. P., Gull S. E, Bridle S. L., 2002, MN- 

RAS, 335, 1037 

Massey R., Rhodes J., Ellis R., Scoville N., Leauthaud A., 

Finoguenov A., Capak P., Bacon D., Aussel H., Kneib J.-P, 

Koekemoer A., McCracken H., Mobasher B., Pires S., Refregier 

A., Sasaki S., Starck J.-L., Taniguchi 2007, Nature, 445, 286 
Meneghetti M., Argazzi R., Pace R, Moscardini L., Dolag K., 

Bartelmann M., Li G., Oguri M., 2007, A&A, 461, 25 
Natarajan P, De Lucia G., Springel V., 2007, MNRAS, 376, 180 
Navarro J. R, Frenk C. S., White S. D. M., 1997, ApJ, 490, 493 
Saha P, Williams L. L. R., 1997, MNRAS, 292, 148 
Salucci P, 2001, MNRAS, 320, LI 

Sand D. J., Treu T., Ellis R. S., Smith G. P, Kneib J.-P, 2008, 
ApJ, 674,711 

Schneider P, Ehlers J., Falco E. E., 1992, Gravitational Lenses. 
Gravitational Lenses, XIV, 560 pp. 112 figs.. Springer- Verlag 
Berlin Heidelberg New York. Also Astronomy and Astrophysics 
Library 

Seljak U., Makarov A., McDonald P., Anderson S. R, Bahcall 

N. A., Brinkmann J., Buries S., Cen R., Doi M., Gunn J. E., 

Ivezic Z., Kent S., Loveday J., Lupton R. H., Munn J. A., Nichol 

R. C., et al. 2005, Phys. Rev. D, 71, 103515 
Simon J. D., Geha M., 2007, ApJ, 670, 313 
Skilling J., 2004, BayeSys and Masslnf 
Smith G. P., Taylor J. E., 2008, ApJ, 682, L73 
Spergel D. N., Steinhardt P. J., 2000, Physical Review Letters, 84, 

3760 

Spergel D. N., Verde L., Peiris H. V, Komatsu E., Nolta M. R., 

Bennett C. L., Halpern M., Hinshaw G., Jarosik N., Kogut A., 

Limon M., Meyer S. S., Page L., Tucker G. S., Weiland J. L., 

Wollack E., Wright E. L., 2003, ApJS, 148, 175 
Suyu S. H., Marshall P. J., Hobson M. P., Blandford R. D., 2006, 

MNRAS, 371, 983 
Valenzuela O., Rhee G., Klypin A., Governato R, Stinson G., 

Quinn T., Wadsley J., 2007, ApJ, 657, 773 



